Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Overview of the INEX 2013 Social Book Search Track

Identifieur interne : 000048 ( France/Analysis ); précédent : 000047; suivant : 000049

Overview of the INEX 2013 Social Book Search Track

Auteurs : Marijn Koolen [Pays-Bas] ; Gabriella Kazai [Royaume-Uni] ; Michael Preminger [Norvège] ; Antoine Doucet [France]

Source :

RBID : Hal:hal-01073644

Abstract

The goal of the INEX 2013 Social Book Search Track is to evaluate approaches for supporting users in reading, searching, and navigating collections of books based on book metadata, the full texts of digitised books or associated user-generated content. The investigation is focused around three tasks: 1) the Social Book Search (SBS) task investigates the complex nature of relevance in book search and the role of traditional and user-generated book metadata in retrieval, 2) the Prove It (PI) task evaluates focused retrieval approaches for searching pages in books that can confirm or refute a given factual claim, 3) the Structure Extraction (SE) task evaluates automatic techniques for deriving book structure from OCR text and layout information. Both the SBS and SE tasks have a growing number of active participants, while the PI task is only tackled by a small number of core groups. In the SBS task, we extended last year's investigation into the nature of book suggestions from the LibraryThing forums and how they compare to book relevance judgements. We found further support that such suggestions are a valuable alternative to traditional test collections that are based on top-k pooling and editorial relevance judgements. The PI task added a further relevance criterion that pages should not only confirm or refute a given factual claim, but should also come from an authoritative source that is of the appropriate genre. The relevance assessments have not yet commenced at the time of writing. The SE task has reached a record number of active participants and has, for the first time, witnessed an improvement in the state of the art.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:hal-01073644

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Overview of the INEX 2013 Social Book Search Track</title>
<author>
<name sortKey="Koolen, Marijn" sort="Koolen, Marijn" uniqKey="Koolen M" first="Marijn" last="Koolen">Marijn Koolen</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-120654" status="VALID">
<orgName>University of Amsterdam [Amsterdam]</orgName>
<desc>
<address>
<addrLine>Spui 21 1012 WX Amsterdam</addrLine>
<country key="NL"></country>
</address>
<ref type="url">http://www.uva.nl/en/home</ref>
</desc>
<listRelation>
<relation active="#struct-303011" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-303011" type="direct">
<org type="institution" xml:id="struct-303011" status="VALID">
<orgName>University of Amsterdam</orgName>
<orgName type="acronym"> UvA</orgName>
<desc>
<address>
<country key="NL"></country>
</address>
<ref type="url">https://www.uva.nl/en/home</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Kazai, Gabriella" sort="Kazai, Gabriella" uniqKey="Kazai G" first="Gabriella" last="Kazai">Gabriella Kazai</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-15503" status="VALID">
<orgName>Microsoft Research [Cambridge]</orgName>
<orgName type="acronym">Microsoft</orgName>
<desc>
<address>
<addrLine>Roger Needham Building 7 J J Thomson Ave Cambridge CB3 0FB, UK</addrLine>
<country key="GB"></country>
</address>
<ref type="url">http://research.microsoft.com/aboutmsr/labs/cambridge/default.aspx</ref>
</desc>
<listRelation>
<relation active="#struct-365620" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-365620" type="direct">
<org type="institution" xml:id="struct-365620" status="VALID">
<orgName>Microsoft Research</orgName>
<desc>
<address>
<country key="US"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Preminger, Michael" sort="Preminger, Michael" uniqKey="Preminger M" first="Michael" last="Preminger">Michael Preminger</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-268127" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="NO"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-380120" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-380120" type="direct">
<org type="institution" xml:id="struct-380120" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Norvège</country>
</affiliation>
</author>
<author>
<name sortKey="Doucet, Antoine" sort="Doucet, Antoine" uniqKey="Doucet A" first="Antoine" last="Doucet">Antoine Doucet</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-388300" status="VALID">
<orgName>Equipe Hultech - Laboratoire GREYC - UMR6072</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-150" type="direct"></relation>
<relation name="UMR6072" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300358" type="indirect"></relation>
<relation active="#struct-300266" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-150" type="direct">
<org type="laboratory" xml:id="struct-150" status="VALID">
<orgName>Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen</orgName>
<orgName type="acronym">GREYC</orgName>
<desc>
<address>
<addrLine>Boulevard du Maréchal Juin - 14050 CAEN Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.greyc.fr</ref>
</desc>
<listRelation>
<relation name="UMR6072" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300358" type="direct"></relation>
<relation active="#struct-300266" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR6072" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300358" type="indirect">
<org type="institution" xml:id="struct-300358" status="VALID">
<orgName>Ecole Nationale Supérieure d'Ingénieurs de Caen</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300266" type="indirect">
<org type="institution" xml:id="struct-300266" status="INCOMING">
<orgName>Université de Caen Basse-Normandie</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Caen</settlement>
<region type="region" nuts="2">Basse-Normandie</region>
</placeName>
<orgName type="university">Université de Caen Basse-Normandie</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01073644</idno>
<idno type="halId">hal-01073644</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01073644</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-01073644</idno>
<date when="2013-09-23">2013-09-23</date>
<idno type="wicri:Area/Hal/Corpus">000097</idno>
<idno type="wicri:Area/Hal/Curation">000097</idno>
<idno type="wicri:Area/Hal/Checkpoint">000044</idno>
<idno type="wicri:Area/Main/Merge">000146</idno>
<idno type="wicri:Area/Main/Curation">000144</idno>
<idno type="wicri:Area/Main/Exploration">000144</idno>
<idno type="wicri:Area/France/Extraction">000048</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Overview of the INEX 2013 Social Book Search Track</title>
<author>
<name sortKey="Koolen, Marijn" sort="Koolen, Marijn" uniqKey="Koolen M" first="Marijn" last="Koolen">Marijn Koolen</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-120654" status="VALID">
<orgName>University of Amsterdam [Amsterdam]</orgName>
<desc>
<address>
<addrLine>Spui 21 1012 WX Amsterdam</addrLine>
<country key="NL"></country>
</address>
<ref type="url">http://www.uva.nl/en/home</ref>
</desc>
<listRelation>
<relation active="#struct-303011" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-303011" type="direct">
<org type="institution" xml:id="struct-303011" status="VALID">
<orgName>University of Amsterdam</orgName>
<orgName type="acronym"> UvA</orgName>
<desc>
<address>
<country key="NL"></country>
</address>
<ref type="url">https://www.uva.nl/en/home</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Kazai, Gabriella" sort="Kazai, Gabriella" uniqKey="Kazai G" first="Gabriella" last="Kazai">Gabriella Kazai</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-15503" status="VALID">
<orgName>Microsoft Research [Cambridge]</orgName>
<orgName type="acronym">Microsoft</orgName>
<desc>
<address>
<addrLine>Roger Needham Building 7 J J Thomson Ave Cambridge CB3 0FB, UK</addrLine>
<country key="GB"></country>
</address>
<ref type="url">http://research.microsoft.com/aboutmsr/labs/cambridge/default.aspx</ref>
</desc>
<listRelation>
<relation active="#struct-365620" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-365620" type="direct">
<org type="institution" xml:id="struct-365620" status="VALID">
<orgName>Microsoft Research</orgName>
<desc>
<address>
<country key="US"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Preminger, Michael" sort="Preminger, Michael" uniqKey="Preminger M" first="Michael" last="Preminger">Michael Preminger</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-268127" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="NO"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-380120" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-380120" type="direct">
<org type="institution" xml:id="struct-380120" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Norvège</country>
</affiliation>
</author>
<author>
<name sortKey="Doucet, Antoine" sort="Doucet, Antoine" uniqKey="Doucet A" first="Antoine" last="Doucet">Antoine Doucet</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-388300" status="VALID">
<orgName>Equipe Hultech - Laboratoire GREYC - UMR6072</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-150" type="direct"></relation>
<relation name="UMR6072" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300358" type="indirect"></relation>
<relation active="#struct-300266" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-150" type="direct">
<org type="laboratory" xml:id="struct-150" status="VALID">
<orgName>Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen</orgName>
<orgName type="acronym">GREYC</orgName>
<desc>
<address>
<addrLine>Boulevard du Maréchal Juin - 14050 CAEN Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.greyc.fr</ref>
</desc>
<listRelation>
<relation name="UMR6072" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300358" type="direct"></relation>
<relation active="#struct-300266" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR6072" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300358" type="indirect">
<org type="institution" xml:id="struct-300358" status="VALID">
<orgName>Ecole Nationale Supérieure d'Ingénieurs de Caen</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300266" type="indirect">
<org type="institution" xml:id="struct-300266" status="INCOMING">
<orgName>Université de Caen Basse-Normandie</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Caen</settlement>
<region type="region" nuts="2">Basse-Normandie</region>
</placeName>
<orgName type="university">Université de Caen Basse-Normandie</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The goal of the INEX 2013 Social Book Search Track is to evaluate approaches for supporting users in reading, searching, and navigating collections of books based on book metadata, the full texts of digitised books or associated user-generated content. The investigation is focused around three tasks: 1) the Social Book Search (SBS) task investigates the complex nature of relevance in book search and the role of traditional and user-generated book metadata in retrieval, 2) the Prove It (PI) task evaluates focused retrieval approaches for searching pages in books that can confirm or refute a given factual claim, 3) the Structure Extraction (SE) task evaluates automatic techniques for deriving book structure from OCR text and layout information. Both the SBS and SE tasks have a growing number of active participants, while the PI task is only tackled by a small number of core groups. In the SBS task, we extended last year's investigation into the nature of book suggestions from the LibraryThing forums and how they compare to book relevance judgements. We found further support that such suggestions are a valuable alternative to traditional test collections that are based on top-k pooling and editorial relevance judgements. The PI task added a further relevance criterion that pages should not only confirm or refute a given factual claim, but should also come from an authoritative source that is of the appropriate genre. The relevance assessments have not yet commenced at the time of writing. The SE task has reached a record number of active participants and has, for the first time, witnessed an improvement in the state of the art.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Norvège</li>
<li>Pays-Bas</li>
<li>Royaume-Uni</li>
</country>
<region>
<li>Basse-Normandie</li>
</region>
<settlement>
<li>Caen</li>
</settlement>
<orgName>
<li>Université de Caen Basse-Normandie</li>
</orgName>
</list>
<tree>
<country name="Pays-Bas">
<noRegion>
<name sortKey="Koolen, Marijn" sort="Koolen, Marijn" uniqKey="Koolen M" first="Marijn" last="Koolen">Marijn Koolen</name>
</noRegion>
</country>
<country name="Royaume-Uni">
<noRegion>
<name sortKey="Kazai, Gabriella" sort="Kazai, Gabriella" uniqKey="Kazai G" first="Gabriella" last="Kazai">Gabriella Kazai</name>
</noRegion>
</country>
<country name="Norvège">
<noRegion>
<name sortKey="Preminger, Michael" sort="Preminger, Michael" uniqKey="Preminger M" first="Michael" last="Preminger">Michael Preminger</name>
</noRegion>
</country>
<country name="France">
<region name="Basse-Normandie">
<name sortKey="Doucet, Antoine" sort="Doucet, Antoine" uniqKey="Doucet A" first="Antoine" last="Doucet">Antoine Doucet</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/France/Analysis
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000048 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/France/Analysis/biblio.hfd -nk 000048 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    France
   |étape=   Analysis
   |type=    RBID
   |clé=     Hal:hal-01073644
   |texte=   Overview of the INEX 2013 Social Book Search Track
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024